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ABSTBACT 

In this study, tuo rival accounts of the mental 
operations used to solve Piaget's three-mountain perspective task are 
tested. One hypothesis is that if children use some form of mental 
rotation through anticipatory imagery, scores should improve as tl;e 
angle of separation between the child and the other viewer is 
decreased. A second hypothesis is that if children simply construct a 
linguistic description of the view adjacent to the other viewer, 
scores should instead reflect the complexity of the linguistic 
description of the ot)ier"s view. In addition, if a linguistic 
strategy is used, a linguistic response mode should optimize correct 
responses; if an imagery solution is employed, a non-linguistic 
response mode would presumably be more appropriate. A total of 120 
children in grades K, 2 and 4 were asked to identify another's view 
either verbally or by picture selection. Besults indicate that th§ 
verbal response mode leads to substantially more correct responses 
(82% vs. and has a minimum of egocentric errors (4% vs. 33%) • 

Correct responses in both verbal and pictorial modes are shown to 
increase as a function of the linguistic complexity, not the angle of 
separation. It is suggested that the egocentric perspective errors 
noted by Piaget can be seen as the by-product of a non^linguistic 
response mode which does not map onto the linguistic mental 
operations typically used to solve the task. Spatial eqocentrism 
emerges as a function of one particular mode of response (pictorial) 
rather than as a general characteristic of pre-*operational thought. 
Analogously, perspective taking seems better described as the 
handling of increasingly complex linguistic descriptions rather ti^an 
as a simple '^present/absent" phenomenon. (Author/HS) 



Documents acquired by ERIC include mai-y infonnal unpubUshed materials not available from other sources. ERIC makes every 
effort to obtain the best copy avaUable. Nevertheless, items of marginal reproducibiUty are often encountered and this affects the 
quality of the microfiche and hardcopy reproductions ERIC makes avaUable via the ERIC Document Reproduction Service (EDRS). 
^Qr^<^ js not responsible for the quality of the original document. Reproductions suppHed by EDRS cjre the best that can be made from 
iginaL 



ERIC 



f 



oo 



U.S. DEPARTMENTOF HEALTH. 
EDUCATION 4 WELFARE 
NATIONAL iNSTITUTE OP 
EDUCATION 



N I"'^ DOCUMENT MAS 'SEEN REPRO- 

^— > - ?^CED EXACTLY AS RECEIVED FROW 

^ PERSON OR ORGANIZATION ORIOJN- 

fJi'iS'^ POfNTS OF VIEW OR OPINIONS 
<Itll^P° NECESSARILY PIEPRE. 

SENTOFFIOAL NATIONAL INSTITUTE OF 



EDUCATION POSITION OR POLICY 



Children's Ability to Coordinate Spatial 
Perspectives Through Linguistic Descriptions^ 

S. William Ives 
Graduate School of Education, Harvard University 



CO 



o 

CO 



ERIC 



Paper presented at the biannual meeting of the Society for Research 
in Child Development, New Orleans, Louisiana, March 1977. 



Footnote to cover sheet : 

The research reported in this article is part of a Ph.D. 
dissertation submitted to the University of Toronto, 1976. The 
preparation of the article was supported in part by a National 
Institute of Education Grant (G-00-3-0169) to Harvard Project Zero. 
An extended version of this paper will appear in D.R. Olson and E. 
Bialystock (Eds.), Explorations in Inner Space; Aspects of the 
Nature an d Development of Spatial Cognition (ia preparation) • ■ 



Introduction 

Within experliiiental psychology one can discern a division between 
two broad categories which has traditionally been inade either implicitly 
or explicitly in most research dealing with cognitive processes.. This 
dichotcny is generally characterized as a distinction between visual- 
spatial processes and verbal-semantic processes. Psychologists have 
generally ass^jmed that itens in their intelligence tests and experiii>pntal 
tasks say something about these verbal and spatial processes and that 
the symbolic mode in which a problem is couched (linguistic or pic- 
torial) determines the processes used to solve the problem. Thus what 
is called verbal reasoning is tested through verbal means and visual- 
spatial skills are tested through pictorial means. 

IJecent research, however* has called these assumptions into question. 
For example, on the one hand, Kossljm (1976) has shown that a verbally 
presented problem such as the question "Does a cat have claws?" can be 
solved through the prepositional > associative processes gener.vj.ly called 
verbal reasoning or through the construction of a spatial image which 
has the characteristics of a continuous , analog representation of the 
problem. On.ciie other hand, David Olson and others (i.e. , Olson, 1973; 
Scher, 1976) have shown that certain visually presented spatial problems 
are solved through structural, linguistic-like descriptions such as 
"up to the right" while Roger Shepard and others (i.e., Shepard & >fetzler, 
1971; Marmor, 1975), have shown that other spatial probleniB are solved 
through the use of continuous, analog representations. Thus the fact 
that a problem has been presented ±n ^ certain medium of form does not 
insure that the mental operations used by the individual will honor 
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the verbal or spatial medium. Nor does it appear that the. dichotoiny 
between verbal aad spatial processes captures the distinctive operations 
involved in problem solving. Certain verbal and spatial problems can 
be splyed through the use of continuous, analog reprtisentations while 
others are solved through structural, linguistic descriptions. Moreover, 
it appears likely that some verbal and spatial problems can be solved 
through either continuous or structural representations CCollins & 
Quillian, 1972; Jorgensen & Kintch, 1973; Olson, 1973; Quinton & Fellows, 
1975) and that different individuals might approach the same problem 
through different mental operations (Gardner^ 1977; Gardner, Wolf, & 
Smith, 1975; Kosslyn, 1976; Quinton & Fellows, 1975; Wolf & Gardner, 
1977) ► To further clarify some of these issues this paper will explore 
one of the classic exartiples of spatial problem solving, Piaget^s three 
mountain task (Piaget & Inhelder, 1956). The present investigation of 
spatial perspective taking will foctis on both the role of symbolic re*- 
presentation in problem solving and the mental operations underlying the 
xChild^s solution to the task. 

Spatial* Perspective Taking 
In one of the most cited experiments in developmental psychology, 
Piaget and Inhelder (1956) presented children ages 4 through 12 with ah 
array of three mountains in order to test their ability to coordinate 
spatial perspectives. Children younger- than 7 tended to choose the picture 
identical to their own view rather than that of the other viewer. 
Piaget and Inhelder attributed this failure to distinguish between view- 
points to the child ^s egocentrism; it was not until age 9 or 10 that the 



child could successfully coordinate the perspectives involved in the 
three mountain task. 

Subsequent research, however, has shown that with modifications in 
the experimental design children as young as two and a half can correctly 
choose another's view (Masangkay, McCluskey, Mclntyre, Sims-Knight, 
Vaughn, & Elavell, 1974) and the number of egocentric errors can be 
greatly reduced (Borke, 1975; Keilgast , 1971), thus indicating that 
the ability to coordinate spatial perspectives is oot straply a present/ 
absent plienoraenon. However, the exact relationship between the initial 
abilities of young children to solve simple perspective tasks and the 
skills required to solve the more difficult three iwjuntain task remains 
to be formulated since an understanding of what constitutes complexity 
in a perspective task requires a knowledge of the mderlying mental 
operations involved in Its solution. 

Few of the many perspective studies have dealt with these under- 
lying mental operations and in those which do it has sin5}ly been assmed 
that some form of mental imagery is involved in the solution (De Lisi, 
Locker, & Youniss, 1976; Harris & Bassett, 1976; Huttenlocher & Presson, 
1973; Nigl & Fishbein, 1974). However, two rival accounts can be 
hypothesized. The child can possibly use some for of mental rotation 
by the means of anticipatory imagery and continuous representations. 
If so, then some form of analog relationship or "second order iso- 
morphism" (i.e., Shepard, Kilpatric, & Cunningham, 1975; Shepard 
>fetzler, 1971) should occur between the internal mental operations and 
the external act of taking another's spatial view as has been shown in 
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an intreaaing amount of research Involving mental imagery. Therefore, 
perspective taki^g scotes should iii5)rove as the angle of separation or 
the rotational distance between thi^ c?iild and the other viewer is 
decreased* s 

If> on the othfer hand, children ><?iinply construct a Structural, 
linguistic description of the"^ view in' front of the other viewer^ scores 
should reflect not the rotational distance but rather the complexity of 
the structural, description of the other's view. In this account the 
"side" or "bacfc^' of a car, for example, should be easier to indicate 
than the front comer which requires a more complex description. Further 
more, referent objects which contain an inherent "side" or "back" such 
as a car should simplify the assignment of a linguistic description. 
Referent displays like the three mountains xAich have no inherent struc- 
tural descriptions need to be described in terms of the relationship 
between objects such as "the red mountain is in front of the yellow 
mountain" and therefore will add more coirplexlty to the task. In 
addltioa, if a structural, linguistic description is used in the 
perspective tasrk, a llnguisticr response mode should optimize correct 
responses; if an anticipatory imagery solution is employed, a non- 
linguistic > continuous response mode such as pictures would presumably 
be more appropriate. 

In order to test these competing accounts the following experiment 
was conducted. 

Experimental Procedures 

Participants 

One hundred and twenty public school children attending Kindergarten , 

rr 
I 

EKLC 



5 



Grade Two and Four .in Scarborough, Ontario were individually tested. 
Materials 

Four toy objects with inherent structural descriptions (car,. man,s> 
house, horse) hereaftet called canonical objiects, and a two-thirds 
replica of Laurendeau and Pinard's (1970) three mountain display 
were used as the referents in the task. A black 35 mm. camera was used 
to mark the "other" viewpoint in all instances. 
P rocedures 

Twenty children per grade were tested on their ability to indicate 
another ^s view in each of two response conditions: picture selection 
and verbal response. In a practice trial using a toy boat and again 
with each referent children were trained on either the correspondence 
of the pictures with their appropriate orientation or the verbal label 
for each possible correct orientation. The verbal labels for the 
three mountain display were: "red in front of yellow," etc. Ten 
basic combinations of object orientation and camera placement (here- 
after simply called orientation) were chosen to allow the camera to be 
placed in positions of 45**, 90"^, 135**, and 180° from the child's position 
and to also allow the correct linguistic response to be the front, back, 
side, front comer, and back comer orientations for each referent object. 
The camera placement and the orientation of the object were manipulated 
independently and each cliild received five of the ten orientations for 
each of the five referents in a counterbalanced manner. 

Results and Discussion 
The Three Mountain Display Condition . A summary of the findings from 
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the three nibuntata ccjadltion is presented ±n this first slide (Table 1). 
While the results from the picture selection condition include more 
correct responses than fomd in Laurendeau and Pinard^s tasks from which 
the stiiailus was taken because of such factors as the practice trial, 
the basic trend of the correct responses and egocentric errors is 
replicated. However, while the picture selection condition has only 
a 38% correct response rate across grade levels, 79% of the verbal 
responses are correct (F(i^i8)-120.93,.«p <.001). The response condition 
effect is highly significant and it accounts for 62% of the variance. 
In addition, there is a 32% egocentric response rate across grade levels 
in the picture selection condition but only a 10% egocentric rate in 
the verbal condition, again a highly significant difference (F(i ,18)='75 .28, 
p <.001) . 

As slidejl (Table 1) indicates the correct ...scores of children in 
both response conditions increase with grade level (F(2, 18)^^0. 18, p <.001) 
in a significant linear trend (Fiin(l,18)'^^0 »33 ^ p < .001) and grade 
level accounts for 20% of the variance. The orientation main effect for 
correct responses is not significant (p>.10) since the number of correct 
responses does not vary as the result of a change in the rotational 
distance nor is there any significant variation as a result of a change 
in the description of the other^s view« However, within the three 
mountain condition there is no variation in the linguistic complexity of 
each correct description. 

The three mountain results thus imply that children generally solve 
the task by assigning structural, linguistic descriptions to the array 
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rather than by mentally rotating the array. Spatial ego cent r ism 
appears to be related to one particular node of response (the pictorial) 
rather than to a general characteristic of p re-ope rational thought. 
Stronger evidence to support these contentions is seen in the results 
of the canonical object condition. 

The Canonical Object Condition . A second slide (Table 2) shows fehe 
canonical object results. The object effect was insignificant in all 
analyses (p>.10) so that all generalizations will be made across objects. 
The grade level effect is again significant (1(2^54)= 132.76, p < .001) 
primarily reflecting a linear increase of correct responses with in- 
creasing grade level (liin(i,54)=264.53, p <.001). In the verbal fe- 
sponse condition 83% of the total, responses are correct while only 47% 
are correct in the picture selection condition and the significant re- 
sponse condition effect (1(1,54) =594. 12, P <-001) accounts for 41% of 
the variance. Only 3% of the verbal responses are egoeeixttic while 40Z 
of the picture selection responses are egocentric with' the significant 
response condition effect (F(i,54)=721.46, p < .001) accounting for 58% 
of the variance in the analysts of egocentric errors. There is no 
systematic decrease in correct responses with an increase in the rotational 
distance as might be predicted by an anticipatory imagery solution. 

.On the other hand, as the second slide shows, the linguistic complexity 
of the correct response does have an effect on correct responses and a 
planned comparison indicates that this difference is significant 
^^simple vs. coii5)lex(l,54)^-^^^*^^» P <-001). Further conparisons of 
correct responses also demonstrate that the conplexity of the correct 
linguistic description affects the child's ability to solve the per- 
spective task but time does not permit conplete coverage. 
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A detailed analysis, of the error patterns also indicates that 
the cojrplexlty of the structural description influences the type of 
response iand that each node of response has its own characteristic error 
pattern. This is particularly evident in the kindergarten responses as 
seen in Table 2. In both response conditions orientations with conp lex 
structural descriptions result more in correct responses. However, each 
mode of response has its own error pattern. Picture selection results 
in more ei^ocentric errors (53% vs. B%) aad verbal responses result in 
more non-ego centirlc errors (50% vs. 25%). When the correct orientation 
requires a two-part linguistic label (for example, "front and side" 
which was the term used for front comer) kindergarten children tended 
to leave off one component of this description. For example, instead 
of saying "front and side" the child would simply respond with "front" 
or "side" and this type of error accounts for 96% of the non-egocentric 
kindergarten errors in the verbal condition seen in the second slide • 
(Table 2). 

We are now able to see that development in perspective taking can 
be described as the ability to handle increasingly more complex struc- 
tural descriptions of the other's viewpoint. Conq^lexity in perspective 
taking can now be defined as a function of the features of a referent, 
that is, whether or not is has inherent structural descriptions and 
as a function of the type of relationship between the referent and the 
other viewer, that is, whether or not the other's view maps directly 
on to one of the inherent structural descriptions. The results of the 
canonical object condition also confirm the finding of the three mountain 
condition that spatial egocentrism in perspective taking appears related 
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to the pictorial mode of response rather than to a general characteristic 
of pre-operational thought and the results also support the contention 
(Olson, 1976) that each means of response has its own characteristic 
error pattern. 

Conclusi on 

The restilts reported in this paper clearly support the position that 
difficulty in the perspective task is a function of the^complexLty of 
the structural description assigned to the other^s view and the degree 
to which the means of response maps on to those structural descriptions. 
However, I do not wish to convey the impression that the solution to 
all spatial problems lies simply in an ability to verbalize or that language 
is the key to all understanding. The use of spatial imagery through 
continuous, analog representations i^ -m^'M documented by Roger Shepard 
and others (Kosslyn and Pomerantz, 1977). What does seem to be emerging 
in recent research is the finding that many spatial problems which 
were traditionally assumed to be solved through anticipatory imagery and 
continuous analog representations are actually organized and solved 
through structural descriptions. Jloreover, it seems that the dichotomy 
between ve ^al and spatial might more accurately be characterized as 
a distinction between the use of continuous and/or structural descriptions 
for both verbal and spatial problems. As Olson and others have stated, 
the use of structural descriptions cuts across verbal and visual repre- 
sentations and, taken together, the work of Kosslyn, Shepard and others 
indicates that the use of continuous representations can occur in both 
verbal and visual presentatiott^". In addition Marmor and Zaback (1976) 
have shown that congenitally blind adults solve tactilely presented i 
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10 

rotation problems using the same type of continuous, analog representa- 
tions as sighted adults thus providing evidence that mental rotation is 
not dependent on visual imagery but rather the use of continuous re- 
presentations. 

Some problems such as mentally rotating a spatial array (Shepard 
& >tetzler, 1971) or unfolding a cube (Shepard & Feng, 1972) seem par- 
ticularly suited for continuous representations. Others such as determining 
another's perspective or comparing the orientation of two lines (Olson, 
1973) are particularly suited for structural descriptions. A third set 
of problems such as determining the truth of descriptive statements 
(Kosslyn, 1976), solving analogies (Olson, 1973) or three-term series 
problems (Quinton & Fellows, 1975), understanding a poem (Gardner, 1977), 
organizing verbal narratives, constructing maps, and dealing with certain 
mathematical principles are perhaps subject to solution through either 
continuous or structural descriptions. It may, in fact, be that most 
problems fall on a continuum as to the applicability 6f continuous or 
structural descriptions. In addition, it may be that individuals also 
fall somewhere on a continuum *as to their preference a:ad skill in using 
continuous or structural descriptions when approaching problems. 

If the above speculations prove correct, then the task of the 
psychologist and the educator is to move beyond the visual-verbal dichotomy, 
determine the skills actually necessary to solve particular tasks, and 
devise training strategies to fit both the nature of the task and the 
preference and skills of individuals. Children who lack the intuitive 
ability to efficiently utilize either structural or continuous representa- 
tions could be trained directly In their deficient skill as Salomon (1974) 
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has done with film modelling of "zooming in" on details. Moreover, de- 
pending on the requirements of the task, children could be shown how to 
use their preferred skills to reach a solution In an area in yAilch they 
were formerly deficient. If children of low spatial ability can be taught 
to utilize structural descriptions in the organization of space and 
children of low^verbal ability can be taught to utilize continuous 
representations In the organization of verbal material, then their 
particular cognitive strengths might not be a limiting factor in the range 
of problems they can master and subsequently in the occupations in which 
they are potentially skillful. 
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EXPERIMENT ONE : 
KESULTS OF THE HRSPECTIVE TASK USING THE THHEE MOUNTAIN DISPLAY 
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EXPERIMTONE 
RESULTS OF THE PERSPECTIVE TASK USING WICAL OBJECTS, 
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